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1 .  Introduction 

Let  C  be  a  circle  with  circumference  equal  to  one.  (For  notational 
convenience,  we  frequently  identify  C  with  an  interval  of  length  one  on  the 
real  line.)  For  probability  measures  on  C  ve  will  define  orderings  «  such 
that  |tj«|t2  means  roughly  that  p,  is  "more  uniform"  than  14  or  that  |i2  is 
"more  clumped"  than|it.  Distributional  or  derings  on  the  real  line  I  have 
proven  to  be  very  useful  in  many  areas  of  probability  and  statistics  and  it  is 
hoped  that  orderings  on  C  will  also  prove  to  be  of  some  use.  Later  in  this 
paper  we  shall  present  applications  of  the  orderings  «  to  the  following 
areas:  random  coverage  problems  on  the  circle,  the  distribution  of  spacings, 
and  the  analysis  of  directional  data.  In  this  section  we  shall  motivate  and 
define  the  orderings  and  present  some  of  their  basic  properties. 

To  define  the  orderings  we  need  the  following  notation.  Let  U  be  a 
random  variable  uniformly  distributed  on  C.  A  random  arc  on  C  having 
length  b  and  clockwise  endpoint  U  will  be  denoted  (U,U*b).  For  any 
probability  measure  (i  on  C  and  length  bc(0,l )  we  define  the  random 
variable  D(ji,b)-|i(lU,lM>)).  Please  note  that  throughout  the  paper  we  shall 
use  |i  (perhaps  with  a  subscript  or  superscript)  to  denote  a  probability 
measure  on  C. 

Our  orderings  have  the  form  iij«|L2  if  and  only  if  D(|i,.b)  is  "less 
variable"  than  D(jL2,b)  for  all  bt(0,l ).  To  motivate  this  type  of  definition  we 
note  first  that 

(1.1)  ED(|i,b)-b  for  all  |t  and  all  bc(0,l). 


□ 


(See  Section  5  for  a  proof  of  this;  All  proofs  are  given  in  Section  5.)  Now 
consider  the  two  extreme  cases:  let  X  be  the  uniform  distribution  on  C  and  6X 
be  the  degenerate  distribution  which  assigns  all  its  mass  to  an  arbitrary 
point  x  cC.  Clearly  DU.b)  -  b  with  probability  one  and 


w«,.b)-{  5 


with  probability  1-b, 
with  probability  b. 
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For  distributions  on  (0,1 1  with  mean  b,  these  are  the  two  extremes  with 
respect  to  variability.  This  suggests  using  the  "variability"  ofD(|i,b)to 
measure  the  "uniformity"  of  |t. 


Y  . 
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Using  two  different  notions  of  "variability"  we  obtain  the  orderings 

«  and  «  defined  as  follovs: 

▼  c 

|L,  «|i2  if  VarUX|tj,b))  i  Var(D(|i2.b))  for  all  bc(0,l). 

|ii  «  |i2  if  ^(Dfii^b))  <  EY(D(|4.b))  for  all  convex  functions  ¥ 
and  all  bc(O.l). 

The  letters  v  and  c  have  been  chosen  to  remind  us  of  the  words  'Variance" 
and  "convex".  Using  (1.1)  and  the  fact  that  ¥(i)-x2  is  oonvex  we  obtain 

(1.2)  |i|  «  |i2  implies  |i,  «|l2 

In  order  to  vindicate  the  above  definitions,  we  shall  now  list  some 

basic  properties  of  the  orderings.  These  properties  are,  for  the  most  part, 

ones  that  any  reasonable  "uniformity”  ordering  ought  to  possess.  The 

properties  hold  for  both  «  and  «  ,  so  they  are  stated  without  the  subscripts 

v  c 

v  and  c. 

(1.3)  X  «  (l  «  6X  for  all  )n.  Here  X  is  the  uniform  distribution  and  6X  is 
a  point  mass  at  i. 

(M)  |Lj  « |i2  and  y.2  « |i3  imply  |i,  « |l3 . 

Convolution  of  measures  on  C  can  be  defined  as  follows.  Identify  C 
with  the  interval  10,1 ).  If  X  and  Y  are  random  variables  with  probability 
measures  |l  and  i,  respectively,  then  the  convolution  is  the  probability 
measure  of  (X+Y)  modi  .  Intuitively  speaking,  the  operation  erf  convolution 
smooths  distributions  and  makes  them  more  uniform,  so  that  it  is  natural  to 
have 

(1.5)  |i*£ « I*  for  all  probability  measures  |i  and  on  C. 

in  this  paragraph,  we  shall  identify  C  with  the  interval  I- 1  /2,  1/2).  A 
probability  density  f  defined  on  C  is  said  to  be  symmetric  and  unimodal  if 
f(x)»f(-i)  for  all  x  and  f  is  nonincreasing  on  the  interval  10,1/2).  Let  |it  and 
|i2  have  densities  f(  and  f2  respectively.  If  f,  and  f2  are  symmetric  and 
unimodal,  and  f2  is  "more  peaked"  about  zero  than  f( ,  then  it  seems 
reasonable  to  say  that  |i(  is  more  uniform  than  |i2 ,  and  we  can  indeed  show 
the  following: 


(1.6) 


If  f,  is  symmetric  and  unimodal  and 

x  x 

J  f,(u)du  <  J  f2(u)du  Tor  0  <  i  <  1/2  .  then  |l,  « |i2  . 

-x  -* 

(It  turns  out  that  we  do  not  need  either  symmetry  or  unimodaiity  for  f2 ,  but 
this  is  not  very  intuitive.)  Here,  we  are  using  "more  peaked"  in  the  same 
sense  as  fiirnbaum  (1948). 

Finally,  ve  note  two  convexity  properties.  The  quantities  p,,pa,...,p 
are  nonnegative  and  sum  to  one. 

n 

(1.7)  Iffij  « |i  for  all  i,  then  ^PilM  <<  l1  • 

i-1 

n 

(1.8)  If  |i  «  (l-p)|t  +  PfLj  for  0  <  p  <  1  and  all  i,  then  |i  « 

i-1 


2,  Ordering  Discrete  Distributions 

In  this  section  we  give  explicit  conditions  which  allow  us  to  verify 
whether  or  not  the  orderings  «  and  <<  hold  for  some  simple  types  of  discrete 

distributions.  In  particular,  these  conditions  allow  us  to  compare  empirical 
distributions  so  that,  given  two  samples  each  containing  n  observations  on  C, 
we  can  determine  if  one  sample  is  more  uniformly  dispersed  than  the  other. 
For  future  convenience  we  define  8fl  to  be  the  set  of  vectors  (x,,x2 . xQ) 

Q 

satisfying  Jxj  -  1  and  it  >  0  for  all  i. 
i-l 

Suppose  a  measure  |t  has  n  atoms  each  having  a  mass  of  1/n .  that  is,  ]i 
is  the  empirical  distribution  of  a  sample  of  n  observations.  Then  |t  is 

determined  (up  to  a  rotation)  by  the  vector  x«(xlfx2 . xn)  of  spadngs 

between  consecutive  atoms  and  we  shall  write  jt-SP(x).  More  precisely,  for 

any  vector  x  c8n,  we  define  the  probability  measure  SP(x)  on  C  (identified 
with  10,1) )  by 


jO-l 

SWi)--5V  where  u„-0  and  u, 

U  ^  -  |  to  I 

i-0 


-  Jxk  for  I  <  i  <  n-1  . 
k-l 
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Now  consider  a  measure  |i  having  n  atoms  (of  varying  masses)  equally 
spaced  around  the  circle  C  so  that  consecutive  atoms  are  separated  by  the 
distance  1/n  .  This  measure  }i  is  determined  (up  to  a  rotation)  by  the  vector 
x-  (x,42.-.  .Ja)  of  probability  masses  for  the  consecutive  atoms  and  we  shall 

write  ji-PM(x).  More  precisely,  for  any  vector  x  c8Q,  we  shall  define  the 
probability  measure  PM(x)  on  C  by 

PM(x)  *  Xxi*i5i/a  • 
i*0 

The  names  SP  and  PM  are  intended  to  remind  us  of  "spadngs"  and 
probability  masses'  respectively.  The  measures  SP(x)  and  PM(x)  are 
inverses  or  duals  in  a  sense  to  be  developed  in  Section  3  so  that  by  the  result 
(31)  we  obtain 

(2.1 )  SP(x)  <<  SP(y)  if  and  only  if  PM(x)  «  PM(y) . 

We  now  present  a  condition  on  x  and  y  which  is  necessary  and 
sufficient  for  SP(x)  <c<  SP(y) .  This  condition  is  stated  in  terms  of  the 

majorization  ordering  for  vectors  which  is  treated  at  length  in  the  books  by 
Hardy,  Littlewood  and  Polya  (1952)  and  Marshall  and  Olkin  (1979).  For  any 
vector  (i,,i2 . xn),  let  X(,).x(2),...,i(n)  denote  the  coordinates  arranged  in 

increasing  order  i(1)  <  x(2)  <  ■••<  i(n)  •  If  two  vectors  x  and  y  in  RD  satisfy 

n  n  n  n 

Ixi  -  2Vi  2i(o  <  Iy(„  for  2  <  k  <  n  , 

i-i  i»l  i*k  i«k 

we  say  that  i  is  majorized  by  y  and  write  x  £  y  . 

For  any  vector  i-(i,,x2 . xn)c8Q  and  integer  k  with  0  <  k  <  n  we 

define  the  vector  x(k)  -  (x^.x^ . x(k))  by 

l(‘)-  Ixui  for  mU  L 
i-0 

Here  the  coordinates  of  x  are  considered  to  be  arranged  in  a  circle  so  that 
xn,j  *  Xj  for  all  i.  Note  that  x(,)-x,  x<0)-(0,0 . 0)  and  i(a)  -  (1,1 . 1).  Our 

condition  can  now  be  stated.  For  x  and  y  in  8D  , 


The  relationship  (2.1 )  becomes  false  upon  replacing  «  by  «  .  so  we 

c  y 

must  state  separate  conditions  for  SP(i)  <<  SP(y)  and  PM(z)  <<  PM(y) . 

These  are  now  given.  For  x  and  y  in  SQ. 

(2.3)  PM(x)  «  PM(y)  if  and  only  if 

o  n 

^(x^)2  L  for  >  >nd 

i«l  i«l 

(2.4)  SP(i)  «  SP(y)  if  and  only  if 

(x<,),x(2,...x(ft-|>)  £  (y(f),y(2) . yl0*1)) . 

Here  (x(l),x(2) . i^'1*)  denotes  the  vector  obtained  by  joining  (stringing 

together)  the  vectors  x(l),x(2).. ...i*0'1*  into  one  long  vector  having  n(n-l) 
components. 

It  is  straightforward  to  check  the  conditions  (2.2),  (2.3)  or  (2.4)  for 
any  given  x  and  y  in  8°,  so  these  conditions  are  useful  in  constructing 

counterexamples  like  the  following.  Let  x  -  (.3. 3. -2, .2)  and  y  -  (.55.0. .45,0). 

We  can  easily  verify  that  SP(x)  «  SP(y)  is  true,  but  that  both  SP(x)  «  SP(y) 

V  c 

and  PM(x)  «  PM(y)  are  false.  This  shows  that  the  orderings  «  and  «  are 

V  C  V 

in  fact  different  and  also  that  the  relation  (2.1 )  becomes  false  upon  replacing 

c  by  y  •  Similarly,  with  s  -  (.4,. 1,. 4,. 1)  and  t  -  (.55.15.15.15) .  we  find 

that  PM(s)  «  PM(t)  is  true,  but  that  SP(s)  «  SP(t)  is  false, 
y  y 

We  wish  to  note  one  more  counterexample.  Let 

|i,-SP(^l,2.3.2.1,3))  and  14  -  SP(^  1.2.3, 1.3.2)) . 

It  is  easy  to  verify  that  both  |i|  « |t2  and  |u  « (i,  are  true.  The  measures 

|tt  and  |i2  are  not  equivalent  in  any  obvious  sense;  neither  measure  can  be 
made  to  coincide  with  the  other  by  using  rotations  or  reflections  of  the  circle. 
Thus  <5  is  not  a  legitimate  partial  ordering;  it  satisfies  the  transitive 

property  (1.4),  but  not  the  anti-symmetry  property. 


For  any  measure  (i  we  define  an  inverse  (or  dual)  measure  |i*  as 
follows.  Identify  C  with  10,1 )  and  let  the  random  variable  U  be  uniform  on 
[0,1 ).  The  random  variable  X-ft(lO.Ul)  can  be  regarded  as  a  random  point  on 
C  so  long  as  0  and  1  are  taken  to  represent  the  same  point  on  C.  Define  |i* 
to  be  the  probability  measure  of  X.  If  |t  has  no  atoms  and  has  support  on  all 
of  C.  then  the  function  F(x)  -  |l(I0  jl)  is  invertible  and  F'd)  -  |t*(l0.xl)  for 
0 <x<  1  .  The  measure  jt*  depends  on  the  arbitrary  choioe  of  the  origin  (zero 

point)  on  C.  but  in  a  trivial  way.  Different  choices  of  the  origin  lead  to 
measures  n*  which  differ  only  by  a  rotation. 

If  the  measures  and  y>2  are  identical  or  can  be  made  so  by  rotating 
one  erf  the  measures,  we  shall  write  jij  ■  |i2  For  the  discrete  measures  in 
Section  2  it  is  easily  seen  that  p.  -  SP(y)  implies  |i*  •  PM(y)  and  similarly,  |i 
■  PM(y)  implies  |i*  ■  SP(y).  In  general,  |t**  ■  y.  for  all  |i. 

It  will  be  convenient  to  use  the  following  notation.  Let  X  and  Y  be 
bounded  random  variables  with  distributions  F  and  G  respectively.  If  Ei'(X) 
<  E’f'(Y)  for  all  convex  functions  ¥,  we  say  that  X  jJj  Y  or  equivalently  F  ^  G. 
This  type  of  ordering  is  well  known  and  has  many  uses  in  queueing  theory 
(see  Stoyan(  1983)).  The  ordering  is  closely  related  to  the  majorization 

ordering  £  for  vectors,  hence  the  similarity  in  the  notation. 

The  main  result  of  this  section  is  that  inversion  preserves  the  ordering 

«  ,  that  is. 
c 

(3  1)  Jt|  <5  and  only  if  <5 1*2  • 

If  we  restrict  ourselves  to  measures  |i  which  have  no  atoms  and  have 
support  on  all  of  C,  we  can  restate  (3.1 )  in  a  more  concrete  fashion.  Under 
this  restriction,  for  any  y  cC  and  t  c  (0,1 )  we  can  define  L(y,|t,t)  to  be  the 
length  of  the  unique  arc  A  on  C  which  has  clockwise  endpoint  y  and  satisfies 
(i(A)-t .  Let  Yj  be  distributed  according  to  |ij  for  i-1,2.  Then 

(3  2)  |t,  <c<|t2  if  and  only  if  L(Y|,|t,.t)^  L(Y2.|t2.t)  for  all  tc (0,1). 

The  relation  (31)  becomes  false  upon  replacing  «  by  «  .  See  the 
discrete  counterexamples  at  the  end  of  section  2.  However,  something 


resembling  (3.2)  can  be  shown  to  hold  for  the  ordering  «  .  For  any  points 

x/y  on  C,  define  d(i.y)  to  be  the  length  of  the  arc  [x,y]  having  clockwise 
endpoint  x  and  counterclockwise  endpoint  y.  When  x-y,  we  randomize  and 
set  dU,y)«0  or  1  with  each  value  having  probability  i/2.  Note  that 
d(x.y)*d(y,xM  unless  x-y.  Let  X,  ,Y,  be  independent  with  distribution  |l, 
and  X2,Y2  be  independent  with  distribution  14.  We  can  show  that 

(3  3)  it,  <y<  14  if  and  only  if  d(Xt.Y,)  j$  d(X2.Y2) . 

Specializing  this  to  discrete  measures  of  the  form  SP(x)  leads  to  the  result 
given  in  (2.4). 

^Applications 

This  section  contains  some  applications  of  the  uniformity  orderings  « 

and  << .  In  the  first  two  applications  the  orderings  are  used  to  give  precise 

versions  erf  some  fairly  intuitive  qualitative  statements. 

Suppose  n  arcs  having  arbitrary  lengths  b,.b2 . bn  are  placed 

uniformly  and  independently  on  C.  Let  A  denote  the  covered  region, 
n 

A  -  yiUj.Uj+bj) 

i-l 

where  U|,U2,...,Un  are  independent  and  uniformly  distributed  on  C. 
Intuition  suggests  that  if  |i,  is  "more  uniform"  than  |t2  ,  then  the  covered 
mass  |t|(«4.)  will  be  "less  variable"  than  14(A) .  We  can  indeed  show  that 

(4.1)  If  14  « 14 ,  then  Var(ii,(A))  <  Var(|t2(A)) . 

Note  that  when  n«l  this  statement  follows  immediately  from  the  definition 
erf  << .  Another  result  along  the  lines  of  (4.1)  is 

(4.2)  If  11,  -  14*14  for  some  measure  jl,  ,  then  14(A)  ^  14(A) . 

Remember  that  *  denotes  convolution.  The  condition  14  - 14*14  is  much 
stronger  than  i4«ji2  (see  result  (1.5)). 
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The  second  application  concerns  the  distribution  of  spacings  and 

higher  order  spacings.  Let  X1.X2 . Xn  be  i.i.d.  from  the  distribution  |i,  . 

Suppose  X(j),X(2) . X(Q)  are  the  points  Xt  arranged  in  counterclockwise 

order  beginning  with  Xq)  ■  Xj .  These  points  divide  C  into  n  segments 

whose  lengths  (called  spacings)  are  denoted  SltS2 . SQ .  To  be  precise  we 

take  Si-d(X(i),X(i4i))  with  d  as  in  (3  3)  and  X(ft4l)-X(|).  Similarly,  let 
T1.T2.....Tn  be  the  spacings  between  n  ii.d.  points  from  the  distribution  |t2 
The  "higher  order  spacings"  are  simply  sums  of  consecutive  spacings. 
The  properties  of  spacings  and  higher  order  spacings  have  been  extensively 
studied.  For  information  and  further  references  on  this  area  see  Pyke 
(1965).  Holst  (1979)  and  Guttorp  and  Lockhart  (1988). 

When  |i(  is  "more  uniform"  than  jt2  .  it  seems  intuitive  that  the 
spacings  (and  higher  order  spacings)  under  |i,  will  be  "less  variable"  than 
those  under  (i2 .  We  can  prove  that 

k  k 

(4.3)  If  |tt  «  f2  • then  Xsi  n  XTi  for  1  <  k  <  n-1  . 

i«l  i-l 

Our  final  area  of  application  is  the  analysis  of  directional  data.  A  basic 
problem  in  this  area  is  that  of  testing  for  uniformity:  Does  a  given  sample  of 
n  observations  look  like  it  was  obtained  by  random  sampling  from  the 
uniform  distribution  on  C  ?  If  the  alternatives  to  uniformity  have  no 
preferred  direction,  it  is  reasonable  to  use  test  procedures  which  are 
invariant  under  rotations  of  the  circle.  Such  an  invariant  procedure  can  be 

described  in  terms  of  a  test  statistic  +(S|,S2 . S„)  which  is  a  function  of  the 

consecutive  spacings  S1.S2 . Sn  between  the  n  data  points;  we  reject  the 

hypothesis  erf  uniformity  when  +  is  sufficiently  large.  The  book  by  Mardia 
(1972)  reviews  many  of  the  test  statistics  which  have  been  proposed  for 
testing  uniformity. 

What  functions  0  will  lead  to  reasonable  tests  for  uniformity? 
Functions  which  are  increasing  with  respect  to  «  or  «  are  obvious 

candidates.  Let  1  and  y  belong  to  8°.  A  function  +  will  be  called  weretsing 
if  4(x)<4(y)  whenever  SP(i)«SP(y).  Aprefiiof  c  <x  v  will  indicate 
whether  we  refer  to  <5  or  «  .  Dearly,  any  K-increasing  function  is  also 

c-increasing.  Using  the  results  (2.2)  and  (2.4)  we  can  easily  show  the 
following: 

(4.4)  If  #  is  a  Schur -convex  function  of  n  arguments,  then 
fa(i)  -  $(x(k))  is  a  c-increasing  function  for  1  <k  <n-l. 
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(4.5)  If  $  is  a  Schur-convex  function  of  n(n-l)  arguments,  then 
y(x)  -  d>(x(,,,x(2,,...,I(<,~,))  is  a  v- increasing  function. 

See  Marshall  and  Olkin  (1979)  for  information  on  Schur-convei  functions. 
Additional  r-increasing  functions  may  be  obtained  by  taking  various 

combinations  of  the  functions  in  (4.4);  If  y,,  y2 . va  are  ^increasing 

functions,  then  f ,♦  y2+ ...  +fB  and  max(y|,  y2 . fB)  are  ^increasing,  etc. 

We  note  that  many  of  the  proposed  test  statistics  are  indeed  either  c 
or  t'-increasing.  We  shall  use  the  nomenclature  of  chapter  7  of  Mardia 
(1972).  This  source  contains  definitions  for  all  the  statistics  mentioned 
below  except  for  the  scan  test  which  is  described  in  Naus  (1966)  and  Cressie 
(1977). 

The  following  statistics  are  all  ^-increasing:  Kuipers  test,  the  range 
test,  the  equal  spacings  test,  Hodges-Ajne’s  test  and  the  scan  test.  These 
statistics  are  all  closely  related  so  that  in  section  5  we  shall  discuss  only  two 
of  them  the  scan  test  and  Kuiper's  test. 

Cressie  (1976,  1979)  considers  test  statistics  of  the  form 

«i>  -  2  Mi1;1) 

i-I 

with  emphasis  on  the  cases  h(u)  -  -log  u  and  h(u)  -  u2.  According  to  (4.4),  a 
statistic  of  this  form  will  be  c- increasing  whenever  h  is  convex. 

Eiamples  of  ^increasing  statistics  can  be  found  in  the  general  class  of 
tests  for  uniformity  developed  by  Beran  (1968,  1969).  These  test  statistics 
can  all  be  rewritten  in  the  form 

o(n-l) 

Ba(x)  -  k  ♦  £  h(yj)  for  some  constant  k  and  function  h, 

i-l 

where  yj  are  the  components  of  the  vector  (x^i*2) . i<°-*)) . 

According  to  (4.5),  the  function  Ba  will  be  K-increasing  whenever  h  is 

convei.  Using  this  result  we  can  show  that  both  Watson's  test  and  Ajne's 

An  test  are  ^-increasing.  See  the  discussion  on  pages  190-192  of  Mardia's 
book. 

The  Rayleigh  test  is  an  example  of  a  test  statistic  which  is  not 
^increasing;  counterexamples  are  easy  to  construct.  This  is  to  be  expected 
because  the  Rayleigh  test  is  intended  only  for  use  in  detecting  unimodal 
alternatives  to  uniformity  and  is  not  sensitive  to  many  other  forms  of 
nonuniformity. 


5,  Proofs 


Proof  of  (1.1).  We  note  that 
E  IXfL.b)  -  |(Jl[x.x*b)d|t)dx 

where  I[x.x*b)  is  the  indicator  function  of  the  given  interval  and  both 
integrations  are  over  all  of  C  which  is  identified  with  10,1)  using  addition 
modulo  one.  Interchanging  the  order  of  integration  yields  (1.1). 

Proof  of  (1.2)  is  immediate. 

Proof  of  (1.3).  Because  of  (1.2),  it  suffices  to  prove  (1.3)  for  the 
ordering  <c< .  In  terms  of  the  ordering  j$  introduced  in  Section  3,  we  must 

show  that  DU,b)  jSj  D(n.,b)  ^  D(8x,b)  for  all  and  b.  The  extremal 
distributions  of  the  ordering  are  well  known  (see  Example  1.9(b),  page  23 

of  Stoyan  (1983) )  so  that  this  result  follows  from  (1.1)  and  the 
accompanying  discussion 

Proof  of  ( 1 .4)  is  immediate. 

Proof  of  ( 1 .5).  Again,  by  ( 1 .2)  it  suffices  to  consider  only  «  .  To 

simplify  the  presentation  we  carry  out  the  proof  only  for  measures  £  which 
are  discrete  with  finitely  many  atoms.  In  this  case  (1.5)  follows  easily  from 
(1.7)  which  is  proved  later.  Suppose  £  has  atoms  of  mass  p,  at  the  points  Xj 
for  1  <  i  <  n.  Define  the  measure  m  by  |ij(B  +  Xj)  ■  |t(B)  for  all  measurable  B 

contained  in  C.  Then  |i*£  -  <c<  |i  for  all  i.  Now  using  (1.7) 

completes  the  proof 

Proof  <V  (1.6).  Again,  it  suffices  to  consider  only  « .  We  shall  first 
restate  the  definition  erf  «  in  a  slightly  different  form.  For  the  measures 
and  |i2  and  8  c  (0,1/2)  we  define  functions  fj  ,  by  fi4(x)  -  |tj(lx-8,x+8]).  Now 

(5.1)  ji,<c<|t2  if  and  only  if  f,  #(U)  £  f2#(U)  for  aU  8  e  (0,1/2) 

where  U  is  uniformly  distributed  on  C. 

Using  the  assumptions  in  (1.6),  it  is  straightforward  to  show  for  all 
8  e  (0,1/2)  that  f,  8  is  symmetric  and  unimodal  and 
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J  f18(u)du  <  J  f2,(u)du  for  0<x<  1/2. 

-x  '* 

For  any  function  g  on  C,  let  g  denote  its  decreasing  rearrangement.  With 
m(y)  equal  to  the  Lebesgue  measure  of  the  set  (i :  g(x)  >  y) ,  ve  can  define 
g(u)«sup(y.  m(y)  >  u)  for  0  <  u  <  1.  Clearly 

2x  x  x  2x 

[  f,,(u)du-  f  f,  j(u)du  <  J  f28(u)du<  [  f2a(u)du 
0  -x  -*  lo 

for  0  <  x  <  1/2  .  Therefore,  using  a  result  of  Hardy,  Littlewood  and  Polya 
(see  page  1 5  of  Marshall  and  Olkin  ( 1 979) ).  ve  conclude  that 
f ,  ,(U)  f2>l(U)  as  desired. 

Proof  of  (1.7).  We  shall  consider  first  the  ordering  «  .  Fii  a  value  of 

c 

b  and  let  Xj  -  D(fijtb)  for  1  <  i  <  n  and  Y  -  W|i,b).  Then 

DfXiPjHj  .bJ-XjPjXi. 

Let  ¥  be  any  convex  function.  We  know  that  iMXj)  <  ET(Y)  for  all  i 
because  fij « |t .  Jensen  s  inequality  now  yields 

^(XiPjXi)  <  HHK(V) 

which  completes  the  proof.  The  argument  for  «  is  the  same  except  that  we 

need  consider  only  the  function  ¥ (x)  -  x2 . 

Proof  of  (1.8).  The  following  fact  is  needed  in  the  proof.  Let  W  and  Z 
be  arbitrary  bounded  random  variables  and  ¥  be  any  convex  function 
having  a  continuous  second  derivative. 

(52)  E¥(W)  <  E¥((l-p)W  ♦  PZ)  for  aU  pc  10,1)  if  and  only  if 

El(Z-W)¥'(W)]  >  0  . 

This  fact  is  proved  by  taking  expectations  in  the  inequalities 

¥(W)  ♦  p¥'(W)(Z-W)  <  ¥((l-p)W  ♦  PZ)  <  ¥(W)  ♦  p¥'(W)(Z-W)  *Kp2  . 

The  first  inequality  is  a  consequence  of  the  conveiity  of  ¥.  The  second 
inequality  is  just  the  Taylor  series  with  the  remainder  term  replaoed  by  an 
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upper  bound.  Let  B  denote  the  bounded  set  in  which  W  and  Z  take  values. 
We  can  choose  K  to  be  any  value  greater  than  \  sup(  Y~(u) :  u  e  B  ) . 

We  shall  prove  (1.8)  only  for  the  ordering  «  .  (The  argument  for  «  is 

c  v 

identical  except  that  we  consider  ooiy  the  function  T(x)  -  x2  .)  Let  X|  and  Y 
be  as  in  the  proof  of  (1.7).  We  must  show  that  EY(Y)  <  EY(2iPjXj)  for  all 

convex  functions  Y.  It  suffices  to  verify  this  for  smooth  convex  functions 
(having  continuous  second  derivatives)  because  any  convex  function  can  be 
approximated  arbitrarily  well  by  a  smooth  one.  Using  (5.2)  and 

for  0<p<l  we  obtain  EKXrY)Y'(Y)l  >  0  for  alii.  Thus 
E[((IiP|Xi)-Y)Y'(Y)j  >0 

and  applying  (5.2)  leads  to  the  desired  conclusion. 

Proof  of  (2.1)  is  im  mediate  from  result  (3.1)  proved  later. 

Proof  of  (2.2).  We  need  some  preliminaries.  For  any  vector  w  - 

(w,.w2 . wQ)  we  define  F(w)  to  be  the  distribution  on  R  which  places  mass 

1  /n  at  each  point  Wj  so  that 

(F(w))(t)  -  I(Wjit) 
i-1 

It  is  well  known  (see  page  17  of  Marshall  and  Olkin  (1979))  that 

(5  3)  v  £  w  if  and  only  if  F(v)  £  F(w) . 

Now  to  the  proof.  We  shall  use  (2.1 )  and  show  instead  that 

PM(i)<<PM(y)  if  and  only  if  £  y(fc)  for  1  <  k  <  n-1.  We  shall  use  £  to 
denote  equality  in  distribution.  It  is  easy  to  see  that 

(5.4)  D(PM(x).k/n)  £  F(x<k>)  for  0  <  k  <  n. 

Using  (5.3)  and  (5.4)  leads  to 

D(PM(x).k/n)  £  D(PM(y),k/n)  for  all  k  if  and  only  if  x(k)  £  y(k)  for  all  k. 
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It  remains  to  show  that  x(k)  £y(k)  for  all  k  implies 

D(PM(i),b)  j!j  D(PM(y).b)  when  b  *  i/n  for  any  integer  i.  This  follows  by 

observing  that 

(5  5)  If  b  -  A(i/n) ♦  ( 1  -A)((i+ 1  )/n)  with  0  <X  <  1  and  0  <  i  <  n-1, 
then  D(PM(x).b)  £  JtP(x<»Ml-Jl)F(x<‘*'>) . 

Proof  of  (2.3).  Using  (1.1)  we  see  that  PM(x)<<PM(y)  if  and  only  if 
ED2(PM(x),b)  <  ED^CPMIyJ.b)  for  all  b.  From  (5.4)  we  obtain 

a 

ED2(PM(x),k/n)  -  • 

i-1 

Combining  this  with  (5  5)  leads  easily  to  the  proof. 

Proof  of  (2.4).  This  result  follows  from  (33)  after  noting  that  if  X  and 

Y  are  independent  random  variables  on  C  with  the  same  distribution  |i,  then 
the  distribution  of  d(X,Y)  is 

~F((x(,).x<2> x(o- 1 )})  ~ F((0,1 )) . 


Proof  of  (3  1).  We  shall  need  some  elementary  facts  concerning  the 
ordering  j!j .  If  the  random  variables  X  and  Y  satisfy  EX  -  EY  then. 

(5.6)  X  £  Y  if  and  only  if  E(X-s),  <  E(Y-s)t  for  all  s. 

and  similarly 

X  jSj  Y  if  and  only  if  E(s-X),  <  B(s-Y)4  for  all  s. 

See  Stoyan(1983),  pages  8-12.  Here  we  have  used  the  notation  (z)4  - 
mai(z.O) 

The  result  (31)  follows  immediately  from  the  formula 

(5.7)  BHXmM),  -  E(s-D(|i*.t))4  . 

The  argument  (using  (5.6))  is 

h<c<H 

iff  EdX|i,.s)-t)4  <  BdX|t2,s)-t),  for  all  s  and  tin (0.1) 


iff  E(s-D(|t*.t))+  <  Els-IX^.t)).  for  all  sand  tin (0,1) 

iff  <c<  m 

We  now  prove  (5.7).  Let  k  denote  convergence  in  distribution.  For 

distributions  on  C  this  has  the  usual  meaning;  (t#  i  |i  if  |tB(I)  -*  |i(I)  for  all 
intervals  I  on  C  whose  endpoints  are  not  atoms  of  |t.  It  is  straightforward  to 

check  that  m,  h  |i  implies  |t*  i  ft*  and  D(|ift.s)  h  D(|i,s)  for  all  s.  Thus 
both  sides  of  equation  (5.7)  are  continuous  with  respect  to  convergence  in 
distribution.  This  implies  that,  in  proving  (5-7),  it  suffices  to  consider 
measures  which  have  no  atoms  and  have  support  on  all  of  C  ;  any 
probability  measure  on  C  can  be  approximated  arbitrarily  well  by  a  member 
of  this  class.  In  the  argument  which  follows  we  restrict  p  to  be  a  member  of 
this  class. 

Identify  C  with  (0,1).  Define  F  and  F*  by  F(i)  -  |i((0,x])  and  F'(x)  - 
ji*(I0,xl)  for  all  x  e  (0,1 ).  The  assumption  on  p  implies  that  F  is  invertible  so 
that  F-1  -  F\  We  shall  regard  F  and  F*  as  one-to-one  mappings  from  C  onto 
itself.  Let  V  and  W  be  independent  random  variables  uniformly  distributed 
on  C  and  consider  the  random  arcs  fV,V*sl  and  (W,W»tl.  Since  F“*  -  F*  we 
have 

(5.8)  P{  [F(V),F(V*s)I  d  (W.WHl }  -  P{  [V,V*s]  d  (F*(W).F*(Wu)l ) . 

This  formula  is  in  fact  true  for  all  probability  measures  p,  but  the  argument 
requires  more  care. 

Consider  the  left  hand  side  of  (5  8)  and  condition  on  the  value  of  V. 
Clearly  P{  lF(V),F(V*s)j  d  IW,W*tl  |  V)  -  (L-t).  where  L  is  the  length  of  the 
interval  lF(V).F(V*s)l.  Note  that  L  -  |t(IV,V*s))  -  D(|i,s).  This  leads  to 

P{  !F(V),F(V*s)]  d  (W,W*t) }  -  E(D(|i,s)-t)« . 

If  we  condition  on  the  value  of  W  in  the  right  hand  side  of  (5.8)  and  use  a 
similar  argument  we  find  that 

P{  IV.V+s]  d  IF*(W),F*(WH)1 }  -  B(s-D(|i*.t))*  • 

This  completes  the  proof. 

Proof  of  (3.2).  Let  us  reexamine  the  right  hand  side  of  (58).  Define  Y 

-  F’(W).  Note  that  Y  is  distributed  according  to  p  and  that  |i((F*(W),F*(W+t)I) 

-  t.  Thus,  by  the  definition  which  precedes  (3  2),  the  length  of 
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|F*(W),F*(W+t)l  is  L(Y,|i,t).  This  means  that  ine  right  hand  side  of  (58) 
equals  E(s-L(Y.ji,t))4  so  that  (5.7)  can  be  rewritten  as 

E(D(|t,s)-t)+  -  Rs-UY.jt.t))*  where  Y  is  distributed  according  to  |i. 

The  proof  is  now  completed  by  applying  (5-6)  as  in  the  proof  of  (3-1  )- 

Proof  of  (3-3)-  Suppose  that  X  and  Y  are  independent  with  the  same 
distribution  |L  We  first  note  that  Ed(X.Y)-l/2.  This  is  true  even  when  |t  has 
atoms,  but  in  this  case  our  convention  regarding  ties  (d(v)-  0  or  1  with 

equal  probability)  is  crucial.  Ed(X,Y)«l/2  is  a  consequence  of  d(X,Y)  t  d(Y,X) 
and  d(X,Y)*d(Y,X)«l  whenX^Y. 

Neit  we  note  that 

(5  9)  ED2(jt,s)-2E(s-d(X,Y))4  for  all  ft . 

We  shall  prove  this  only  for  |i  which  have  no  atoms  and  have  support  on  all 
of  C.  (The  proof  for  general  1 1  can  then  be  obtained  by  a  limiting  argument.) 

Go  back  to  (5.8)  and  replace  t  by  a  random  variable  U  which  is  uniformly 
distributed  on  (0,1 )  and  independent  of  V  and  W.  Now  W  and  W*U  are 

independent  and  uniformly  distributed  on  C  so  that  F*(W)  and  F*(W*U)  are 
independent  with  the  distribution  |t.  The  length  of  IF*(W),F*(W+U)1  is  thus 
d(X,Y)with  X-F*(W)  and  Y  -  F*(W+U).  Repeating  the  argument  in  the 
proof  of  (5-7)  now  yields  EdXji,s)-U)4  -  E(s-d(X,Y))4 .  Because 

1 

J  (i-u)4du  -  i2/2  for  0  <  i  <  1 
0 

we  have  E(D(|ji.s)-U)4  -  (ED2(|i,s))/2  and  the  proof  of  (5.9)  is  complete. 

Using  (5.9)  we  see  that  ED2(^t,s)  <  BD^.s)  for  all  s  if  and  only  if 
E(s-d(X,,Y,))4  <  E(s-d(X2,Y2))4  for  all  s.  Now  (1.1)  and  (5-6)  lead  us 
immediately  to  (3.3). 

Proof  of  (4.1 ).  First  note  that  Eji,(^)  -  E|t2(*4.).  This  follows  from 
Fubini's  theorem  which  gives 

E|t(**)-  Jp(ic*)d|i(x) 

C 
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and  the  fact  that  the  distribution  of  the  random  set  A  is  rotationally 
invariant.  From  the  equality  of  means  we  see  immediately  that 
Var(|t,(A))<Var(}4(^))  if  and  only  if  EO-fi,!**))2  <  Ed-^U*))2. 

Using  l-|i(A)  -  |t(«4.c)  (with  c  denoting  complement)  and  Fubini  s 
theorem  we  obtain 


(510) 


Ed-itUl))2 - 


I  P(i  zAc, y  C^c}d|t(x)d|t(y) . 

:c 


Define  Bi  -  lUj.U^bj)  so  that  A  is  the  union  of  B1.B2 . Bn.  The  intervals 

Bi  are  independent  so  that 


P(xe^c,y  c*4c) 


J"Jp(xcB[,y  eBf) 

i-i 


where  it  is  easily  seen  that 

PUcB^yeB^)  -  (d(x.y)-bj).  ♦  (l-dd.yl-bj), . 


Thus  (5.10)  can  be  rewritten  as  Ed-pUl))2  -  E*F(d(X.Y))  where  X  and  Y  are 
independent  with  distribution  (i  and 

¥(z)-ni(z-bi)4-(l-z-bi)J. 

i-1 

It  is  straightforward  to  show  that  Y  is  convex  so  that  (4.1 )  now  follows  from 
result  (3.3) 

Proof  of  (4.2).  To  simplify  the  presentation  we  carry  out  the  proof 
only  for  measures  |t3  which  are  discrete  with  finitely  many  atoms.  Suppose 
|i3  has  atoms  of  mass  p4  at  the  points  x,  for  1  <  i  <  n.  Then 


so  that 
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y(|L|(«4))  < 

holds  for  any  convex  function  Y.  Now  taking  expectations  and  using  the 
rotational  invariance  of  the  distribution  of  A  leads  to  EY((l,(«4))  <EY(|4(^l)) 
as  desired. 

Proof  of  (4.3).  We  shall  prove  this  only  for  |t|  which  have  no  atoms 

and  have  support  on  all  of  C.  Let  U,,U2 . U0  be  ild.  uniform  on  C. 

Suppose  U(i).U(2) . U(n)  are  the  points  Uj  arranged  in  counterclockwise 

order  beginning  with  Uj-U(1) .  Identify  C  with  10,1 )  and  define  F,  by  F,(x)  - 
(ijdO.x))  for  i  -  1,2.  Now  define  the  dual  meaures  |t*  and  onC  by 

|L*([0,x))  -  Fj*(x).  The  points  Fi,(U|),Fjl(U2),...,Fil(Ull)  are  i.i.d.  from  the 
distribution  ^  so  that  we  have 

is,  i  n;(IU(i).U(k.„))  «nd  IT,  i  H5(iua).u(k.„)) 
i*l  i-1 

Let  fi  -  IU(|)  Conditional  on  its  length,  B  is  just  a  uniformly  placed 

random  arc  on  C  so  that  jiJ(B)  ^  jij(B)  follows  immediately  from  (3. 1 ).  This 
completes  the  proof. 

Statistics  for  testing  uniformity.  We  now  show  that  the  scan  statistic 
and  Kuiper's  statistic  are  c-increasing.  Let  s  -  (s,,s2 . sn)  be  the  vector  of 

consecutive  spacings  between  n  points  on  C  and  let  N(i,b)  denote  the 
number  of  points  on  C  contained  in  the  interval  (x,x+b).  The  scan  statistic 

N(b)  having  width  parameter  b  is  defined  as  N(b)  -  sup  N(x,b)  with  i 

z 

ranging  over  C.  The  scan  statistic  has  an  obvious  relationship  with  the 
higher  order  spacings; 

N(b)>k+1  if  and  only  if  mins^<b 

i  ' 

if  and  only  if  max  s(?"k*  >  1-b  . 
i  1 

The  function  #(x,,x2 . xn)  -  max  x,  is  Schur-convex  so  that  (4.4)  implies 

the  function  #s)  -  max  is  o increasing  which  in  turn  implies  that  the 
scan  statistic  is  ^-increasing 


Nov  we  consider  Kuiper  s  test.  Choose  an  arbitrary  zero  point  on  C 

and  identify  C  with  the  interval  [0,1 ).  Let  y(j).y(2) . Y(B)  be  the  order 

statistics  of  the  n  points  in  our  sample.  Luiper's  test  statistic  VA  may  be 
computed  using  the  formula 

Vfl  -  i*  tnu  (yU)  -  i )  -  min  (y(l)  -  i )  • 

See  page  174  of  Mardia(1972).  By  arguing  as  in  Section  4  of  Cressie  (1977), 
it  is  straightforward  to  reeipress  Kuipers  test  in  terms  of  the  spadngs  as 

(5.11)  Vn  -  ~  ♦  max  {  max  (s-^  )^  } 

where  s-~  denotes  the  vector  (s,  .  *2'n  ‘a  ^  The  function  inside 

braces  in  (5.1 1 )  is  ^-increasing  by  (4.4).  Now,  since  the  maximum  of 
^increasing  functions  is  again  o  increasing,  we  see  that  Vn  is  c- increasing  as 
desired. 
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